Multi-fidelity Bandit Optimisation∗
نویسنده
چکیده
In many scientific and engineering applications, we are tasked with the optimisation of an expensive to evaluate black box function. Traditional methods for this problem assume just the availability of this single function. However, in many cases, cheap approximations may be available. For example, in optimal policy search in robotics, the expensive real world behaviour of a robot can be approximated by cheap computer simulations. We formalise this task as a multifidelity bandit problem where the target function and its approximations are sampled from a Gaussian process. We develop a method based on upper confidence bound (UCB) techniques and prove that it uses the approximations to eliminate low function value regions and uses the expensive evaluations mostly in a small region containing the optimum. For instance, in the above robot control example, our method would use the simulations to quickly eliminate suboptimal policies, while reserving real world trials for a small set of promising candidates. I will begin this talk with reviews on UCB methods for bandits and Gaussian processes before proceeding to multi-fidelity optimisation. A preliminary version of the paper is available at www.cs.cmu.edu/ ̃kkandasa/pubs/kandasamy16mfbo.pdf. ∗Internal Talk, Gatsby Unit, July 12, 2016.
منابع مشابه
Multi-fidelity Gaussian Process Bandit Optimisation
In many scientific and engineering applications, we are tasked with the optimisation of an expensive to evaluate black box function f . Traditional settings for this problem assume just the availability of this single function. However, in many cases, cheap approximations to f may be obtainable. For example, the expensive real world behaviour of a robot can be approximated by a cheap computer s...
متن کاملGaussian Process Bandit Optimisation with Multi-fidelity Evaluations
In many scientific and engineering applications, we are tasked with the optimisation of an expensive to evaluate black box function f . Traditional methods for this problem assume just the availability of this single function. However, in many cases, cheap approximations to f may be obtainable. For example, the expensive real world behaviour of a robot can be approximated by a cheap computer si...
متن کاملThesis Proposal Tuning Hyper-parameters without Grad Students: Scaling up Bandit Optimisation
Many scientific and engineering tasks can be cast as bandit optimisation problems, where we need to sequentially evaluate a noisy black box function with the goal of finding its optimum. Typically, each function evaluation incurs a large computational or economic cost, and we need to keep the number of evaluations to a minimum. Some applications include tuning the hyper-parameters of machine le...
متن کاملGaussian Process Optimisation with Multi-fidelity Evaluations
In many scientific and engineering applications, we are tasked with the optimisation of an expensive to evaluate black box function f . Traditional methods for this problem assume just the availability of this single function. However, in many cases, cheap approximations to f may be obtainable. For example, the expensive real world behaviour of a robot can be approximated by a cheap computer si...
متن کاملMulti-fidelity Bayesian Optimisation with Continuous Approximations
Bandit methods for black-box optimisation, such as Bayesian optimisation, are used in a variety of applications including hyper-parameter tuning and experiment design. Recently, multifidelity methods have garnered considerable attention since function evaluations have become increasingly expensive in such applications. Multifidelity methods use cheap approximations to the function of interest t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016